首页> 外文OA文献 >Advanced text authorship detection methods and their application to biblical texts
【2h】

Advanced text authorship detection methods and their application to biblical texts

机译:高级文本作者身份检测方法及其在圣经文本中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Authorship attribution has a range of applications in a growing number of fields such as forensic evidence, plagiarism detection, email filtering, and web information management. In this study, three attribution techniques are extended, tested on a corpus of English texts, and applied to a book in the New Testament of disputed authorship. The word recurrence interval method compares standard deviations of the number of words between successive occurrences of a keyword both graphically and with chi-squared tests. The trigram Markov method compares the probabilities of the occurrence of words conditional on the preceding two words to determine the similarity between texts. The third method extracts stylometric measures such as the frequency of occurrence of function words and from these constructs text classification models using multiple discriminant analysis. The effectiveness of these techniques is compared. The accuracy of the results obtained by some of these extended methods is higher than many of the current state of the art approaches. Statistical evidence is presented about the authorship of the selected book from the New Testament.
机译:作者身份归属在法医证据,抄袭检测,电子邮件过滤和Web信息管理等越来越多的领域中都有广泛的应用。在这项研究中,扩展了三种归因技术,在一组英语文本上进行了测试,并将其应用于《新约》中有争议的作者身份的一本书。单词重复间隔方法比较了图形和卡方检验在关键字连续出现之间的单词数量标准差。 Trigram Markov方法比较以前两个单词为条件的单词出现的概率,以确定文本之间的相似性。第三种方法使用多重判别分析从这些构造文本分类模型中提取诸如功能词出现频率之类的风格度量。比较了这些技术的有效性。通过这些扩展方法中的某些方法获得的结果的准确性高于许多当前最新技术水平。提供了有关新约中所选书的作者身份的统计证据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号